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Abstract 


We analyse the (7r)-convergence rate of irreducible and aperiodic Markov chains 
with 7V-band transition probability matrix P and with invariant distribution tt. This 
analysis is heavily based on: first the study of the essential spectral radius ress(P|f 2 (^)) of 
derived from Hennion’s quasi-compactness criteria; second the connection between 
the spectral gap property (SG 2 ) of P on £^(7r) and the V-geometric ergodicity of P. 
Specihcally, (SG 2 ) is shown to hold under the condition 


N 


ao := 


E 

m— — N 


limsup \/ P{i, i + m) P*{i + m, i) 

i —)• +00 


< 1. 


Moreover 'ress(P|f 2 ('^)) < ap. Simple conditions on asymptotic properties of P and of its 
invariant probability distribution tt to ensure that Oq < 1 are given. In particular this 
allows us to obtain estimates of the £^(7r)-geometric convergence rate of random walks 
with bounded increments. The specific case of reversible P is also addressed. Numerical 
bounds on the convergence rate can be provided via a truncation procedure. This is 
illustrated on the Metropolis-Hastings algorithm. 


AMS subject classification : 60J10; 47B07 

Keywords : Rate of convergence, .^^-spectral gap, V-geometric ergodicity. Essential spectral 

radius, Metropolis-Hastings algorithm. 


1 Introduction 

Let P := (R(*, j))(ij)6x2 bs ^ Markov kernel on a countable state space X. For the sake of 
simplicity we suppose that X := N. Throughout the paper we assume that P is irreducible 
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and aperiodic, that P has a unique invariant probability measure denoted by vr := (7r(i))jgN 
(observe that Vz G N, 7r(z) > 0 from irreducibility), and finally that 

3io G N, 3N £ N*, Vz > zo : |z - j| > iV ^ P{i,j) = 0. (ASl) 

We denote by (^^(vr), || • II 2 ) the usual Hilbert space of sequences (/(z))igM S such that 
II/II2 := < 00 . It is well-known that P defines a linear contraction on 

£^(7r), and that its adjoint operator P* on ^^(vr) is defined by P*{i,j) := 7r(j) P(j, z)/7r(z). 
The kernel P is said to have the spectral gap property on ^^(vr) at rate p G (0,1) if there 
exists some positive constants p G (0,1) and C G (0, -t- 00 ) such that 

Vn>l,V/G^ 2 (^)^ ||P”/-n/|| 2 <Cp "||/||2 with n/:=7r(/)lN, (SG 2 ) 

where 7r(/) := Ei>o •^(*) ^(*)' ^ relevant and standard issue is to compute the value (or to 
hnd an upper bound) of 


Q 2 := inf {/9 G (0,1) : (SG 2 ) holds true}. 


( 1 ) 


In this work we use the quasi-compactness criteria of [Hen93] to study (SG 2 ) and to 
estimate Q 2 . In Section 2 it is proved that (SG 2 ) holds when 

N 

do := E limsup sjP{i, i + m) P*{i + m, z) < 1. (AS2) 

7.7- 2 — >■ +00 

m=—N 

Moreover ress{P\e^{Tr)) — ^o- The main argument to obtain this result is the Doeblin-Fortet 
inequality in Lemma 2. We refer to [Hen93] for the definition of the essential spectral radius 
VessiT) (related to quasi-compactness) of a bounded linear operator T on a Banach space. In 
Section 3, under the following assumptions 


Vm = —N,... ,N, P{i, i + m) - > am G [0,1]. 

i —?*• +00 


vr(z -I- 1) 
7r(f) 


g-^ r G [0,1) 

I +00 


N 

^ feOfc < 0 , 

k=-N 


(AS3) 

(AS4) 

(NERI) 


we establish that (AS 2 ) holds (hence (SG 2 )) and that ao can be explicitly computed in 
function of r and the Om’s. Observe that (NERI) means that the expectation of the asymp¬ 
totic random increments is negative. Moreover, using the inequality ressiP\p{TT)) E olq, 
Property (SG 2 ) is proved to be connected to the so-called E-geometric ergodicity of P for 
V := ( 7 r(n)“^/^)„gp^, which corresponds to the spectral gap property on the usual weighted- 
supremum space By associated with V. In particular, denoting the minimal E-geometrical 
ergodic rate by gy, it is proved that, either g2 and gy are both less than oq, or g2 = gy- 
As a result, an accurate bound of ^2 is obtained for random walks (RW) with i.d. bounded 
increments using the results of [HL14b]. In the reversible case (Section 4) the previous results 
hold under Assumptions (ASS) and (AS4) provided that Um 7 ^ CL-m for at least one m. A 
first illustration to Birth-and-Death Markov chains (BDMC) is proposed in Subsection 4.1. 
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The reversible case naturally contains the Markov kernels associated with the Metropolis- 
Hastings (M-H) Algorithm. In Subsection 4.2 we observe that, if the target distribution vr 
and the proposal kernel Q := {Q{i, satisfy (ASl), (ASS) and (AS 4 ), then so is the 

associated reversible M-H kernel P, which then satishes (SG 2 ). 

Estimating Q2 is a difficult but relevant issue. This question is investigated in Section 5 
where an accurate estimation of Q2 is obtained by using the above mentioned link between 
g 2 and qv and by applying the truncation procedure in [HL14a]. Numerical applications to 
discrete MCMC are presented at the end of Section 5. Bounding Q2 in the reversible case is 
of special interest since (SG2) holds in this case with (7=1 and p = Q 2 - 

The spectral gap property for Markov processes has been widely investigated in the discrete 
and continuous-time cases (e.g. see [Ros71] for discrete-time, [Che04] for continuous-time, and 
[CGI3] for dynamical systems). We point out that there exist different dehnitions of the spec¬ 
tral gap property according that we are concerned with discrete or continuous-time case. A 
simple and concise presentation about this difference is proposed in [YueOO, MS 13]. The focus 
of our paper is on the discrete time case. In the reversible case, the equivalence between the 
geometrical ergodicity and (SG2) is proved in [RR97] and Inequality g 2 < gv is obtained 
in [BaxOS, Th.6.1.]. This equivalence fails in the non-reversible case (see [KM12]). The link 
between g2 and gy stated in our Proposition 1 is obtained with no reversibility condition. 
The works [SWll, Wubl2] provide formulae for ^2 in terms of isoperimetric constants which 
are related to P in reversible case and to P and P* in non-reversible case. However, to the 
best of our knowledge, no explicit value (or upper bounds) of g2 can be derived from these 
formulae for discrete Markov chains with band transition matrices. For instance (SG2) is 
proved to hold in [Wubl2] for RW with i.d. bounded increments satisfying (NERI) and a 
weak reversibility condition, but no explicit bounds for g 2 are derived from isoperimetric con¬ 
stants. For such RWs, our method gives the exact value of g2 with no reversibility assumption 
(see Examples 1 and 2). Concerning BDMCs, recall that the decay parameter of P, which 
equals to g2 for these models (see [vDS95]), is only known for specihc instances of BDMC 
(see Remark 3 for details). In the context of discrete MCMC, no satisfactory bound for g2 
was known to the best of our knowledge, except for special instances as the simulation of a 
geometric distribution corresponding to a simple BDMC (see [MT96, Ex. 2]). The bounds 
for g2 obtained in Section 5 for discrete MCMC via truncation procedure applies to any tar¬ 
get distribution vr satisfying (AS 4 ) when the proposal kernel Q satishes (ASl) and (ASS). 
The accuracy of our estimation in Section 5 depends on the order k of the used truncated 
hnite matrix Pk (see Tables 2 and 3). Our explicit bound ress{P\p(-K)) < oo in Theorem 1 
for discrete Markov chains with band transition matrices is the preliminary key results in 
this work. Recall that r(,ss{P\(P(iT)) is n natural lower bound of g 2 (see [HL14b, Prop. 2.1] 
with ^^(vr) in place of By)- The essential spectral radius of Markov operators on a L^-type 
space is investigated for discrete-time Markov chains with general state space in [Wu04] (see 
also [GW06]), but no explicit bound for re,ss{P\p(iT)) can be derived a priori from these the¬ 
oretical results for discrete Markov chains with band transition matrices, except Inequality 
r(,ss{P\p{TT)) < ^ess{P\Bv) i'^ reversible case (see [Wu04, Th. 5.5.]). Finally recall that, 
for any Markov chain (A„)„gN with transition kernel P satisfying (SG2), the Berry-Esseen 
theorem and the hrst-order Edgeworth expansion apply to additive functional of (A„)„gN 
under the expected third-order moment condition, see [FHL12]. 
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2 (SG 2 ) under Assumption (ASl) on P 

Theorem 1 If Condition (AS2) holds, then P satisfies (SG 2 ). Moreover < oq- 

Proof. Let denote the usual Banach space of sequences (/(i))iGN G satisfying the 

following condition: ||/||i := |/(i)| 7r(i) < 00 . 

Lemma 1 The identity map is compact from £'^p) into 

Lemma 2 For any a > ao, there exists a positive constant L = L{a) such that 

||^/||2< a ||/||2 + L ||/|| i . 


It follows from these lemmas and from [Hen93] that P is quasi-compact on Pp) with 
ress(P|£2(jr)) < cr- Since a can be chosen arbitrarily close to oq, this gives ress{P\i2(^Tr)) — Q^o- 
Then (SG 2 ) is deduced from aperiodicity and irreducibility assumptions. □ 

Lemma 1 follows from the Cantor diagonal procedure. 

Proof of Lemma 2. Under Assumption (ASl) we define 


Vi > io, Vm = -A,..., A, fimp) '■= \/ P(i, iPm)P*{iP m, i). 


( 2 ) 


Let a > ao, with ao given in (AS2). Fix £ = £{a) > io such that supj>£/3m(i) < a. 

For / G £^(p we have from Minkowski’s inequality and the band structure of P for i > £ 


ll^/l|2 < 


1 1/2 


^ 2 <£ 


x:i(p/)(i)iG(i) + E E P(i,i + m)f(i + m} 


N 


i<l 


i>l m=—N 
N 


2 1 1/2 

7r(i) 


< ax:i(p/)wkw+ E E P{i,i + m) f{i + m) 


i>l m=—N 


2 1 1/2 

7r(i) 


where (7/ > 0 is derived from equivalent norms on the space C^. Note that I (P f)(i) \ vrfz) < 
||P/||i < ll/lli so that setting L := Ci 


\\Pfh < L\\f\\i + 


N 


E E P{i i P m) f(i + m) 


i>i m=—N 


2 1 1/2 

7r(i) 


(3) 


It remains to obtain the expected control of the second terms in the right hand side of (3). 
For m = —A,..., A, let us define Fm = (Pm(i))iGN £ £‘^P) by 


Fm{i) ■= 


0 


if i < £ 


P{i, i + m) f{i + m) if i> £. 
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Then 


N 


E E P{i, i + m) f{i + m) 


i>£ m=—N 
N 


2 1 1/2 
Tr{i) 


E 


m II2 


m=—N 


N 


m=—N ^ i>£ 

7r(i) P{i, i + m) 


m=—N 

N 


1/2 




m) 


m=—N i>i 
N 


< 


< 


{supI3m{i)) + 


m=-N 


N 


'^i+m 


^J| '^2+m 


|/(i + m)\^Tri+m 


1/2 


(from the definition of P* 


i>£ 


1/2 


(from (2)) 


Y sup/3m (0 

AT i>^ 

n=—N — 

The statement in Lemma 2 can be deduced from the previous inequality and from (3). □ 


3 (SG 2 ) and geometric ergodicity. Application to RWs with 

i.d. bounded increments 


We specify Theorem 1 in terms of y—geometric ergodicity for V := {Tr{n)~^^'^)nen- Let 
{Bvt II ■ llv) denote the weighted-supremum space of sequences {g{n))n^^ G such that 
ll^lly := sup^gj^ F(n)“^ lfl'(^)l < oo- Recall that P is said to be R-geometrically ergodic if P 
satishes the spectral gap property on By, namely: there exists C G (0,+ 00 ) and p G (0,1) 
such that 

Vn>l,V/G^u, ||P"/-n/||y <Cp"||/||y. (SGy) 

When this property holds, we dehne 

QV '■= inf{/9 G (0,1) : (SGy) holds true}. (4) 


Remark 1 Under Assumptions (ASS) and (AS4), we have 


N 


do : = 


lim sup a/P{ i, i + m) P*{i + 


m,i] = 


m=—N 


i —>■ +00 


N 


Yj 0,rnT if T e (0, 1) 


m=—N 

Oo 


(5) 


if T = 0, 


Moreover, if t = 0 in (AS4), then Om = 0 for every m = . ,N. 


Indeed, if (AS4) holds withr G (0,1), then the claimed formula follows from the definition 
of P*{■,■). If T = 0, then Om = 0 for every m > 0 since the invariance of tt gives 


N / . N 

7r{i) 


m=—N 


( 6 ) 
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so that the sequence (P(i + m, i)7r(i + m)/7r(i))^ must be hounded for each m < 0. Now observe 
that we have for m < 0 


\/ P{i, i + m) P*{i + m, i) = P{i, i + m)\ 
Next, setting i = i + m, we obtain for m > 0 


7r{i) 


7r{i + m) 


0 when i —)• +oo. 


P{i, i + m)P*{i + m, i) = P{i - 

V 


= P{i-m,i) 


'Kil — m) / 7r(£) 


7r(£) Y 

since we know that [P{£ — m,l) 'Kil — m)/7r(^))^ is bounded. Hence ag = Oo- 


0 when i —>■ +oo 


Proposition 1 If P and k satisfy Assumptions (ASS), (AS4) and (NERI), then P satis¬ 
fies (AS2) (and og < 1 with og given in (5)). Moreover P satisfies both (SG 2 ) and (SGy), 
we have max{ress{P^i^^),ressiP\e‘ 2 (Tr))) < <^o> the following assertions hold: 

(a) if gv < oo; then Q 2 < og; 

(h) if Qv > og, then Q 2 = Qv- 


Proof. If r = 0 in (AS4), then og = og < 1 from (5) and (NERI). Now assume that (AS4) 
holds with r G (0,1). Then og = Ylm=-N = '^(V^)) where: Vt > 0, ^/’(t) := 

'l^k=-N ■ Moreover it easily follows from the invariance of k that 'iP{t) = 1. Inequality 

og = 'if{y/T) < 1 then follows from the following assertions: Vt G (r, 1), ifft) < 1 and 
Vt G (0,r) U (l,+oo), 'ip{t) > 1. To prove these properties, note that '(({r) = V’(l) = 1 
and that is convex on (0,+ 00 ) since the second derivative of 'ip is positive on (0,+oo). 
Moreover we have lim^^+oo'^(f) = +00 since > 0 for some k < 0 (use '(’(t) = V'(l) = 1 
and r G (0,1)). Similarly, limi_j,g+ 'ip{t) = +00 since > 0 for some A: > 0. This gives the 
desired properties on ip since ip'il) > 0 from (NERI). 

(SG 2 ) and ress{P\i‘2{'K)) — <^0 follow from Theorem 1. Next (SGy) is deduced from the 
well-known link (see [MT93]) between geometric ergodicity and the following drift inequality: 

Va G (ag, 1), 3L = > 0, PV < aV P L Ijsf. (7) 


This inequality holds from 


{py)ii) 

V{i) 


N 

P{i, i + m) 

m=—N 


T^{i) A " 

K{i + m) J 


-)■ ag. 

i —>• +00 


This gives (7), from which (SGy) is derived using aperiodicity and irreducibility. It also 
follows from (7) that ress{P\Bv) — ^ [HL14b, Prop. 3.1]). Thus ress{P\Bv) — 
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Now we prove (a) and (b) using the spectral properties of [HL14b, Prop. 2.1] of both 
P|£ 2 ( 7 r) and P\By (due to quasi-compactness). We will also use the following obvious inclusion: 
^^(vr) C Bv- In particular every eigenvalue of is also an eigenvalue for P\By- First 

assume that gy < cto- Then there is no eigenvalue for P\By i'^ iFe annulus F := {A G C : ao < 
|A| < 1} since ress(i^B^^) < ao. From ^^(vr) C By it follows that there is also no eigenvalue for 
P|£ 2 (.n-) in this annulus. Hence p 2 < ao since ress{P\£2(TT)) — ®o- Second assume that gy > oq. 
Then P\By admits an eigenvalue A G C such that |A| = gy. Let / G By, f ^ 0, such that 
Pf = A/. We know from [HL14b, Prop. 2.2] that there exists some /3 = I3\ € (0,1) such that 
|/(n)| = 0{V{n)^) = 0 ( 7 r(n)“^/^), so that |/(n)p 7 r(n) = 0 ( 7 r(n)d“^i), thus / G ■^^(vr) from 
(AS4). We have proved that g 2 > gy. Finally the converse inequality is true since every 
eigenvalue of T]£ 2 (jr) is an eigenvalue for P\By- Thus g 2 = gv- Tl 


Example 1 (RWs with i.d. bounded increments) Let P be defined as follows. There 
exist some positive integers c,g,d G N* such that 


Vz G {0, ...,5-1}, '^P{i,j) = l-, 

j=0 

aj_i if i - g < j <i + d 
0 otherwise. 

d 

{a-g ,..., Ud) G [0, ; a.g > 0, > 0, = 1. 

k=-g 


Vz>5',ViGN, P{i,j) = 


( 8 a) 

( 8 b) 

( 8 c) 


ITe assume that P is aperiodic and irreducible, and that Assumtion (NERI) holds, that is: 
'^t=-g ^ < 0. Then P admits a unique invariant distribution tt, and the conclusions 
of Proposition 1 hold. Moreover it can be derived from standard results of linear difference 
equation that Tr{n) ~ cr"" when n^+oo, with r G (0,1) defined by if{T) = 1, where fii{t) := 
Y.k=-N ■ Thus, if-f := then By = {(5(n))neN G C^, sup^gp, 7 “” | 5 (n)| < 00 }. 

Then we know from [HLlfb, Prop. 3.2] that res<j(T’|g^) = ao with ao given in (5), and that 
gy can be computed from an algebraic polynomial elimination. More precisely, the procedure 
in [HL14bj developed for a special value 7 can be applied for 7 := by considering V := 

{A G C : z/:(yT) < |A| < 1}. When Assertion (b) of Proposition 1 applies, we obtain the exact 
value of g 2 (see Example 2). Property (SG 2 ) is proved in fWubl2, Th. 2] under an extra weak 
reversibility assumption (with no explicit bound on g 2 ). However, except in case g = d = 1 
where reversibility is automatic, a RW with i.d. hounded increments is not reversible or even 
weak reversible in general. Note that no reversibility condition is required in Proposition 1. 


Example 2 (Numerical examples in case g = 2 and d = 1) Let P be defined by 

P(0,0) = a G (0,1), P(0,l) = l-a, P(l,0) = 6 G (0,1), P(l,2) = l-6 (9) 

Vn > 2, P{n, n - 2) = 1/2, P(n, n - 1) = 1/3, P{n, n) = 0, P{n, n + l) = l/6. (10) 

The form of boundary probabilities in (9) and the special values in (10) are chosen for con¬ 

venience. Other (finitely many) boundary probabilities in (9) and other values in (10) could 
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be considered provided that P is irreducible and aperiodic and that (a_ 2 , 0 -i,aoj®i) satis¬ 
fies a_ 2 ,ai > 0 and (NERI) i.e. Oi < 2 a _2 + fl-i- Here the fonction ijj is given by: 
ip{t) := t^l2 + t/3 + l/ 6 t = 1 + (i — l)(i^ — 5t/3 — \l‘i)l2t. Then function if{-) — 1 has a 
unique zero over (0,1) which is t = — 5)/6 ~ 0.1805 and oq = 'ip{y/T) ~ 0.6242. Let 

7 := y/r ~ 2.3540 and V := ( 7 "')nGN- Using the procedure from [HLl^b] and Proposition 1, 
we give in Table 1 the values of oq, qv and Q 2 for this instance. 


{a,b) 

ao 

PV 

Q2 

( 1 / 2 , 1 / 2 ) 

0.624 

0.624 

< 0.624 

( 1 / 10 , 1 / 10 ) 

0.624 

0.688 

0.688 

(1/50,1/50) 

0.624 

0.757 

0.757 


Table 1: Convergence rate on ^^(vr) for different boundary transition probabilities {a,b) 


Remark 2 If (AS4) in Proposition 1 is reinforced by the condition 7 r(n) ~ cr"' when 
n—T'+oo with T € ( 0 , 1 ) (e.g. see Example 1), then let us consider By = {ig{'n))nen S 
C^, sup^gj^ 7 “"’| 5 (n)| < oo} with 7 := Then we deduce from [HL14b, Prop. 3.2] 

that r(,ss{P\Bv) ~ *^0 with ao given in (5), so that gy < ckq implies that gy = since 
Qv P 'i"essiP\Bv)■ Then it follows from Proposition 1 that g 2 < gy and that this inequality is 
an equality when gy > ao- The passage from (SGy) to (SG 2 ) and the inequality g 2 < gy was 
established in [RR97, BaxOS] for general reversible V-geometrically ergodic Markov kernels. 
Again note that no reversibility condition is assumed in Proposition 1. 


4 Applications to the reversible case 


The reversible case corresponds to the condition P = P* (i.e. P is self-adjoint in £'^{'ir)), 
namely: '7{i,j) G Tr{i) P{i,j) = TT{j) P{j,i) (detailed balance condition). Then (SG 2 ) is 
equivalent to the condition p 2 = ||-P~n||2 < 1 , where || • II2 denotes here the operator norm 
on ^^(vr). Thus, when (SG 2 ) holds in the reversible case, we have C = 1 and p = g 2 , that is 

Vn > 1 , V/ G ei-K), IIP-/ - 7r(/)l||2 < ^? 2 " II/II 2 . (11) 


Corollary 1 If P is reversible, then: 


1. P satisfies (SG 2 ) and res<j(T|£ 2 (.„.)) < olq under (AS2), with: 


ao := E lim sup -s/ P{i, i + m) P{i + m, i) | 

\ 2 ^ +00 / 

m=—N ^ ^ 


< 1. 














2. If Condition (ASS) holds true, then 


N 



( 12 ) 


m=l 


Consequently, if Om 7^ o--m for at least one m G {1,..., A^}, then P satisfies (AS2). 

3. If P satisfies (ASS) and if vr satisfies (AS4) with r G (0,1), then am 7 ^ O-m provided 
that Om 7^ 0. Consequently, if Om 7^ 0 for some \m\ G {1,... , N}, then the eonclusions of 
Proposition 1 hold with aQ given in (12). 


4- If P satisfies (ASS) with oq < 1 and if tt satisfies (AS4) with r = 0, then the conclusions 


of Proposition 1 hold. 

- Dans le corollaire 1, je mettrai : Consequently, if Om 7^ 0 for \m\ G {1,... ,N} car vaut 
aussi pour les m < 0 

Keep in mind that all our results are stated for positive recurrent Markov kernels. For 
instance, for Markov chain associated with P{i, i — 1) ■= p, P{i, i) ■= r, P{i, i + 1) := q where 
p + r + q = 1, Formula (12) is ao = 1 — {^/q — ^/p)^, but the existence of vr is only guaranteed 
when p > q. 

Proof. The first statement follows from Theorem 1 and reversibility. Next (ASS) gives 
ao = '^m=-N s/o^m O-m, hence Assertion 2. since Ylm=-N = 1- If moreover (AS4) holds 
with r G (0,1), then Om 7^ «-m for every m G {1,..., A^} since r™ a_m = Um from the balance 
condition. Thus, under (ASS) and (AS4) with r G (0,1), we obtain from Assertion 2. that 
ao < 1. Moreover, since the real numbers ao given in (12) and in (5) are equal, all the spectral 
properties obtained in Proposition 1 remain valid. Idem for Assertion 4. from Remark 1. □ 


4.1 Birth-and-Death Markov chains (BDMC) 

The transition kernel P := {Pii, of a Birth-and-Death Markov chains is defined by 


fro qo 0 . \ 


Pi ri qi 


(13) 


P : = 


0 P2 1-2 q2 


\- ■ 

Recall that, under the following conditions 


/ 


OO 


ro < 1, Vi > 1, 0 < qi,pi < 1 



(14) 


P is irreducible, aperiodic and tt (unique) is given by: 7r(0) = 1/S, 7r(i) = 

Moreover it is well-known that P is reversible w.r.t. tt. Finally Condition (AS2) writes as: 


ao := lim sup y/pfq/Il + limsupr* -|- limsup ^qiPi+i < 1. 


(15) 


9 









Consequently, under Conditions (14) and (15), P satisfies (SG 2 ) and ress(-P|£ 2 ( 7 r)) — ®0- In 
particular, if the sequences (pj)jgN*) (^i)*GN nnd ((?i)igN in (13) admit a limit when i^+ 00 , 
say p, r, q, then (SG 2 ) holds provided that p > q. Moreover ress{P\£‘2{n)) < 1 “ {^/P — ■ 

Example 3 (State-independent BDMC) 

Let P given by (13) such that, for any i > 1, pi := p, ri : 
that p + r + q = 1 and p > q > 0. Let tq € (0,1) and /3o 
with V{n) := {pfq)"'^'^ can be derived from [HL14b, Prop 

• ifroG [(do, 1), then Q 2 < r + 2ylpq; 

• ifroe (0,/3o], then : 

(a) in case 2p < (l — g -|- ^/pqY ■ Q 2 < r + 2y/pq; 

(b) in case 2p > (^1 — q + y/pqY, setting (di := p — ^/pq — ^Jr{r + 2.^fpq) 

I p(l — rn) I 

Q2 = \rod -—— w/ien ro e (0,/3i] 

ro-l + q' 

Q 2 <r + 2y/^ whenro € [ldi,ldo). 

Remark 3 (Discussion on the £^(7r)-spectral gap and the decay parameter) 

Let P be a BDMC satisfying (If)- P can be proved that the decay parameter of P, denoted 
by 7 in [vDS95j but by 'Jds here to avoid confusion, equals to Q2, that is (from reversibility) : 
Ids = Q2 = ||-P~n||2. But note that 'Jds is only known for specific instances of BDMC 
from [vDS95] (see [KovlOj for a recent contribution). For a general Markov kernel P, we only 
have (see also fPop77, Isa79]) jds ^ Q 2 - In particular, the decay parameter does not provide 
information on non-reversible RWs with i.d. bounded increments of Section 3. 


(16a) 

(16b) 


:= r, qi ;= q, with p,q,r £ [0,1] such 
:= 1 — q — y/pq. The bounds for gy 
. 4-1], so that (Corollary 1): 


4.2 The Metropolis-Hastings Algorithm 

Let TT = (7r(i))jgj^ (target distribution) be a probability measure on N known up to a multi¬ 
plicative constant. Let Q := {Q{i,j))[ij )^^‘2 (proposal kernel) be any transition kernel on N. 
The associated Metropolis-Hastings (M-H) Markov kernel P := {P{i,j))[ij )£^‘2 is defined by 

pUj) ;= / ““ if ^ ^ 

I if^ = j- 

It is well-known that P is reversible with respect to tt and that vr is P-invariant. 

Corollary 2 Assume that 7r(i) > 0 for every f E N and that vr satisfies (AS4) with r € (0,1). 
Assume that Q is an aperiodic and irreducible Markov kernel on N such that for every {i,j) E 
Q{i,j) = 0 77 Q{j,i) = 0, satisfying (ASl) and the following condition (see (AS3)J 

Vm = —A, ...,A, gm := . li™ Q{i,i + m). (17) 
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Finally assume that [qk,q-k) 7^ (0,0) for some k S Then the associated M-H 

kernel P satisfies (SG 2 ) and ress{P\e'^{iT)) < oo with 

N 

ao := 1 - ^ (v^ - ^/P-m) where pk : = 

m=l 

If (AS4) holds with t = 0, then p^ = 0 for every m = 1,... ,N, and the above conclusions 
holds true with oq ■ = Pq provided that pQ <1. 

Proof. It is well-known that P is irreducible and aperiodic under the basic assumptions on 
Q. If Q satisfies (ASl) for some N, then so is P (with the same N). Assumption (ASS) 
holds for P: limj_,.+oo * + w-) = Pm with pm defined in (18). Then apply Corollary 1. □ 

Example 4 Assume that vr (possibly known up to a multiplicative constant) is such that 
7r{i) > 0 for every z G N and satisfies (AS4). Let Q be a transition kernel on N satisfying 

(5(0, 0) := r < 1, (5(0, 1) := 1-r, Vz > 1, Q{i,i-1) = q, Q{i,i) = l-2q, Q{i,i+1) = q 

for some q G (0,1/2]. The associated M-H Markov kernel P'^^l is given by P^®)(0,1) = 
min(l — r , g7r(l)/7r(0)) and 

Vz > 1, (z, z — 1) = <7 min ( 1, P^^'^ {ifi + 1) = q min ( 1, 

V 7r(z) ) \ 7r(z) 

P('?)(i,z) := 1-J^p('?)(z,£). 

The conditions of Corollary 2 are trivially satisfied. Then P^^^ satisfies (SG 2 ). Next oq = 
ao{q) in (18) is 


min [qk , q_k) if k ^ 0 
1 - {Pi + P-e) 'ifk = 0. 


aoiq) = 1 - Vt)‘^ (19) 

since the Pm’s in (18) are given by p_i = q, Po = l — q — qr, pi = qr. When q G (0,1/2], ao((7) 
is minimal for q = 1/2, thus q = 1/2 provides the minimal bound for ress{P^£ 2 (^^'j) The relevant 
question is to find q G (0,1/2] providing the minimal value of Q 2 = Q 2 {q) (See Example 6). 


Example 5 (Simulation of Poisson distribution with parameter 1) Letvr be the Pois¬ 
son distribution with parameter A := 1, defined by 7r(z) := exp(—l)/z!. Then (AS4) holds 
with r = 0. Introduce the proposal kernel Q of Example 4 with r ;= 1/2 and q := 1/2. The 
associated M-H kernel P is given by P^^\0,0) = P^®^(0,1) = 1/2 and 


Vz > 1, P('')(z,z-1) = i P^‘i\i,i) 


i 

2{i + iy 


p(‘')(z,z + l) 


1 

2(i + !)■ 


We know from Example 4 that P^Ti satisfies (SG 2 ) and ressiP^i^T^-^) < c^o = 1/2- The rate of 
convergence Q 2 = Q 2 {q) of P'^'h) {g studied in Example 7. 
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5 Bound for Q 2 via truncation and numerical applications 


Let us consider the following k-t\i truncated (and augmented) matrix associated with P: 

,/ X r /X f-P(Lj) if 0 < i < A: — 1 and 0<j<k — 2 

^ ^ ^ ^ ^ ^ \Ee>k-iPih^) if 0<i< fc-l and j = A;-l. 

Let (7(Pk) denote the set of eigenvalues of P^, and define ph '■= max||A|, A G a{Pk), |A| < l}. 


Recall that V{i) := and that gy is defined in (4). The statement below follows 

from Proposition 1 and from the weak perturbation method in [HL14a] applied to P\By, for 
which the drift inequality (7) plays an important role. 


Proposition 2 If P satisfies (ASS), (AS4) and (NERI), then the following properties holds 
with ao given in (5): 


(o.) i?2 < CkQ QV < oiQ, o-nd in this case we have limsup^ pk < oo/ 

(b) Q2> oq QV > olQ) this case we have Q2 = Qv = hm^ p^. 


Below the estimation of the convergence rate Q 2 for some Metropolis-Hastings Markov 
kernel P is derived from Proposition 2. Recall that Inequality (11) applies when P is reversible. 
The generic procedure for the following instances of Markov kernel P is as follows: 


1. Compute oo given in (18) and choose a small e > 0 

2. k:=2 

3. Consider the /c-order truncated matrix P^ of the kernel P. 

4. Compute the second highest eigenvalue pk of Pk- 

5. If \pk — Pk-i\ > £ then (k := k + 1, return to step 3) 
else if Pk > ao then ^2 — Pk else Q 2 < ao- 

It is clear that the control of the stabilization of the sequence {pk)k >2 through the comparison 
between \pk — Pk-i\ and e only provides an estimation of Q 2 - 

Example 6 (Example 4 continued) 

Let us consider the probability distribution vr given by 7r(i) := C {i + 1) r* for n G N where C 
is a (possibly unknown) normalisation constant and 0 < r < 1. Then (AS4) is satisfied. If 
we choose an RW as in Example 4 for the proposal kernel, the associated M-H kernel is 
defined by P('?)(0,1) = min (1 — r , 2 <7 r) and 

Vi > 1, i — 1) = gmin ^1, — i + 1 ) = 5 min ^ 1 , r 

P('')(i,i) := l-^p('')(i,^). 


12 





r = 0.2 

T = 0.5 

q 

ao(q, r) 

pfc(q) 

q2(q) 

Qo(q, t) 

Pfc(q) 

q2(q) 

0.1 

0.9694 

P27 — 0.9710 

~ 0.9710 

0.9914 

P3g ~ 0.9921 

~ 0.9921 

0.2 

0.9389 

P30 — 0.9421 

~ 0.9421 

0.9828 

P44 ~ 0.9842 

~ 0.9842 

0.3 

0.9083 

P31 ~ 0.9131 

~ 0.9131 

0.9743 

P47 ~ 0.9763 

~ 0.9763 

0.4 

0.8778 

P32 ~ 0.8842 

~ 0.8842 

0.9657 

P50 ~ 0.9684 

~ 0.9684 

0.5 

0.8472 

P33 ^ 0.8552 

~ 0.8552 

0.9571 

P51 ~ 0.9605 

~ 0.9605 


T = 0.6 

T = 0.8 

q 

«o(q, t) 

pfc(q) 

Q2{q) 

Q:o(q, r) 

pk{q) 

q2(q) 

0.1 

0.9949 

P44 ~ 0.9953 

~ 0.9953 

0.99889 

P55 ~ 0.99883 

< 0.99889 

0.2 

0.9898 

P51 ~ 0.9906 

~ 0.9906 

0.99777 

Pqq ~ 0.99781 

~ 0.99781 

0.3 

0.9848 

P55 ~ 0.9860 

~ 0.9860 

0.99666 

Pj3 ~ 0.9968 

~ 0.9968 

0.4 

0.9797 

pss ~ 0.9814 

~ 0.9814 

0.99554 

PYg ~ 0.99579 

~ 0.99579 

0.5 

0.9746 

P60 - 0.9767 

~ 0.9767 

0.99443 

pg3 ~ 0.9948 

~ 0.9948 


Table 2: Results for different values of r with e = W The second eigenvalue pk = PkiQ) of 
Pk = is obtained from the observed empirical stabilization of pk with respect to k. 


For q E (0,1/2], satisfies (SG2) with ress{Plp^^)) < ao{q) = I - q{l - (see (19)). 

Table 2 based on Proposition 2 gives the estimate of Q 2 {q) of 


Example 7 (Example 5 continued) Table 3 based on Proposition 2 gives the estimation 
of g 2 {q) of the M-p[ P^^^l used in the simulation of the Poisson distribution of Example 5. 

Note that q := 1/2 gives the smallest value of ressiP^£ 2 (^^'^) = 00(9) = 0-5, with oq given by 

(19). However q := 1/2 does not provide the minimal rate of convergence in £^{'K)-norm (or 
in By-norm). More precisely, for q = 1/2, the kernel admits some eigenvalues in the 
annulus T := {A G K ; 0.5 < |A| < 1}, among which Q 2 {q) ~ 0.8090 is the larger one in 
absolute value. Actually the minimal rate of convergence is achieved at qK, 0.38 and note that 
every value 0.2 < q < 0.5 in Table 3 provides a minimal rate than for q := 1/2. It could be 

conjectured from numerical evidence that for q < qo with qo ~ 0.35, Q 2 = O(o{q). 
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Q 

ao(q) = 

pkiq) 

Q2iq) 

0.1 

0.9 

P37 ~ 0.9003 

~ 0.9003 

0.2 

0.8 

P83 ~ 0.8008 

~ 0.8008 

0.3 

0.7 

P151 ^ 0.7015 

~ 0.7015 

0.38 

0.62 

Pqi ~ 0.6301 

~ 0.6301 

0.4 

0.6 

piY ~ 0.6568 

~ 0.6568 

0.5 

0.5 

pu ~ 0.8090 

~ 0.8090 


Table 3: pk = Pkio) is obtained from the observed empirical stabilization of pk with e = 10 ^ 
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